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Abstract 

It  is  known  that  observations  from  any  censored  life  (or  competing 
risk)  process  can  be  described  by  a  random  censoring  (or  indepen¬ 
dent  risk)  model.  On  the  other  hand  it  is  impossible  to  verify  that 
the  censoring  is  really  random.  A  class  of  age-dependent  censoring 
processes  are  defined  and  the  class  of  corresponding  subsurvival  func¬ 
tions  is  described.  Exponential  life  variables  censored  by  independent 
variables  can  also  be  described  by  age-dependent  censoring  models  if 
the  censoring  variable  is  DFR.  The  total  time  on  test  statistic  consis¬ 
tently  estimates  the  expected  life  of  an  exponential  fife  variable  under 
random  right  censoring.  If  the  censoring  is  age-dependent,  the  Total 
Time  on  Test  statistic  will  severly  overestimate  the  expected  life  of  the 
variable  of  interest.  Care  should  be  taken  to  motivate  the  application 
of  the  total  time  on  test  statistic  in  such  situations. 
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1  Introduction 

For  exponential  reliability  life  variables  subjected  to  random  right  censoring, 
the  Total  Time  on  Test  statistic 


total  time  on  test  ,  „ 

TTT  - -  m 

#  of  failures  K  ) 

is  the  inverse  maximum  likelihood  estimator  of  the  failure  rate.  It  is  widely 
used  in  reliability  analysis  to  estimate  the  expectation  and  failure  rate  of  the 
uncensored  life  process. 

In  practice  it  is  impossible  to  determine  whether  the  censoring  is  really 
random  or  whether  the  uncensored  life  process  is  really  exponential1.  In  many 
situations,  other  assumptions  would  also  be  prime  facie  plausible.  In  practice, 
analysts  regard  the  assumptions  underlying  the  TTT  as  idealizations,  and 
expect  that  the  error  caused  by  using  these  idealizations  will  be  small. 

This  paper  shows  that  the  idealization  is  not  harmless.  Two  classes  of 
censoring  models  for  right  censored  life  variables  are  considered.  The  first 
class  is  random  right  censoring.  The  second  class  assumes  that  whether  a  life 
variable  is  censored  is  independent  of  its  age,  but  given  that  it  is  censored,  the 
censoring  time  may  depend  on  age.  These  are  termed  age-dependent  censor¬ 
ing  models.  This  type  of  coupling  between  age  and  censoring  time  has  prime 
facie  plausibility  in  many  situations.  Suppose  a  reliability  component  shows 
symptoms  prior  to  failure.  If  these  symptoms  are  observed  by  personnel, 
the  component  will  be  taken  off  line  and  repaired,  hence  censored.  However, 
the  process  of  observation  may  be  described  by  a  random  variable  which  is 
independent  of  the  component’s  age.  In  medical  cohort  studies,  patients  who 
feel  healthy  after  a  treatment  might  be  more  likely  to  move  to  another  city, 
thus  causing  a  censored  observation,  whereas  others  tend  to  stay  put. 

It  has  been  known  at  least  since  Cox  [4]  that  any  competing  risk  or 
censored  life  process  can  be  described  by  an  independent  model  (see  also  [5], 
[6],  [10].)  These  authors  draw  attention  to  the  problem  of ’’identifiability”, 
namely  that  the  empirical  failure  data  do  not  determine  a  unique  model. 
Empirical  failure  data  determine  at  most  a  pair  of  subsurvival  functions. 

‘Tests  for  exponentiality  assuming  random  censoring  can  be  given  (4) 
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This  paper  describes  the  set  of  subsurvival  functions  which  can  be  de¬ 
scribed  by  age-dependent  censoring.  A  necessary  and  sufficient  condition  is 
that  the  conditional  subsurvival  function  of  the  life  variable  should  domi¬ 
nate  the  conditional  subsurvival  function  of  the  censoring  variable.  If  the  life 
variable  is  exponential  and  if  the  censoring  is  random,  then  the  conditional 
subsurvival  function  of  the  life  variable  dominates  that  of  the  censoring  vari¬ 
able  if  the  latter  has  a  decreasing  failure  rate  (DFR).  The  dominance  relation 
is  reversed  if  the  censoring  variable  has  an  increasing  failure  rate  (IFR).  Cases 
in  which  the  censoring  variables  are  also  exponentially  distributed  lie  on  the 
boundary  between  these  two  subsets.  A  concluding  section  points  out  that 
the  consequences  of  choosing  the  wrong  model  can  be  severe,  when  estimating 
the  expectation  ofs  the  life  variable.  This  is  especially  true  when  censoring 
is  frequent,  as  is  usually  the  case  in  reliability  applications.  A  ’’dual”  model 
in  which  the  roles  of  the  life  and  censoring  variables  is  reversed,  is  discussed 
briefly. 


2  Notation  and  Definitions 


Let  X  and  Z  be  positive  random  varibles  with  distribution  functions  Fx  and 
Fz .  XLZ  says  that  X  and  Z  are  independent,  X  ~  Z  says  that  F\  =  Fz 
and  X  A  Z  —  min{A,  Z}.  Sx  =  1  —  Fx  is  the  survival  function  of  X.  Sx(t) 
=P{X  >  t  and  X  <  Z}  is  the  subsurvival  function.  Note  that  S x  depends 
on  Z,  although  this  fact  is  suppressed  in  the  notation.  Note  also  that  5^(0) 
=P{X  <  Z}.  We  assume  that  Y  =  {min{X,  Zj,  l{x<zy}  is  observable, 
where  1{.}  is  the  indicator  function  of  the  event  {•}.  A'  is  considered  the 
variable  of  interest,  and  Z  is  the  censoring  variable.  By  observing  Y  we 
observe  the  smallest  of  these  two,  and  observe  which  it  is.  It  is  convenient  to 
speak  of  variables  A',  as  components  and  of  variables  Z;  as  censoring  times. 
If  Zi  <  Xi  we  say  the  component  i  is  censored.  The  value  assigned  to  Z, 
when  Z,  >  A,  is  arbitrary. 


no 


3.  S?(0)  =  1  -  S2*(0). 

Let  Yi\i  =  1, 2,  •  •  •  be  i.i.d.  copies  of  Y,  and  Y  =  Y\,  Y2,  •  •  •.  The  subsur¬ 
vival  pair  Sx  and  S*z  determine  the  distribution  of  Y  and  hence  contains  all 
the  information  which  can  be  extracted  from  observing  Y. 


3  Two  Classes  of  Models 

We  study  two  classes  of  models  for  Y. 

•  Class  I:  random  right  censoring:  XA.Z 

•  Class  II:  age-dependent  censoring:  Z  =  X  —  £;  X  -Lsgn£ 

In  age-dependent  censoring  the  probability  of  a  component  being  censored 
is  independent  of  its  age,  but  given  that  it  is  censored,  the  time  at  which  it 
is  censored  may  depend  on  its  age.  Putting  o  =  (X  —  £)/X,  we  can  also 
express  an  age  dependent  censoring  variable  as  Z  =  Xa. 

In  this  section  we  determine  under  what  circumstances  right  censored  life 
data  can  distinguish  between  these  two  model  classes.  The  following  theorem 
has  been  in  the  folklore  for  many  years  (see  ,  [1],  [3], [4],  [6],  [7],  [10]  ),  but  a 
proof  sketch  is  provided  for  completeness2. 

Theorem  1  Let  X  and  Z  be  continuous  life  variables. 

1.  If  X±Z.  and  Xf±Z'  with  Sx  =  S*,  and  Sz  =  S*z,,  then  X  ~  X'  and 
Z  ~  Z'  . 

2.  if  Si  and  S[  are  a  subsurvival  pair  and  are  continuous,  then  there 
exist  independent  random  variables  X  and  Z  such  that  S *x  =  S']1  and 
Sz  —  S2- 

2Most  authors  show  that  any  censored  life  process  can  be  modeled  by  a  process  for 
which  P{X  >  t  fl  Z  >  i}  =  P{X  >  t}P{Z  >  t),  which  is  implied  by  but  not  equivalent 
with  independence  [6].  The  second  statement  in  Theorem  1  is  proved  by  Tsiatis  [10];  the 
version  here  is  simplified  by  the  restriction  to  two  "competing  risks”. 
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Proof:  1:  Using  SxSz  =  Sx  +  S%  and  dSx(t)  =  Sz{t)dSx(t ); 


Sx(t)  =  exp 


Sz(t )  =  exp 


(  [*  dSx(u)  1 

Vo  5^(n)  +  5M«)/’ 

(  rl  dSz(u)  \ 

V°  Sx(u)  +  Sz(u)  J 


(2) 

(3) 


Similar  expressions  hold  for  X'  and  Z'.  For  2,  choose  X±Z  with  survival 
functions 


Sx(t)  =  exp 


Sz(t)  =  exp 


r  ^  dsjju)  ] 

\Jo  St(u)  +  S2*(u)  j  ’ 

r  rt  dsj{u)  1 

Vo  Sj(u)  4-  S2(u)  J  ' 


(4) 

(5) 


From  1  we  know  that  these  survival  functions  can  also  be  written  as  (2)  and 
(3).  Thus  we  have 

dS{  _  S;  +  S2  _  dS£ 
dS}  ”  +S*Z  ~  dS*z 

Equating  the  product  of  equations  (2)  and  (3)  with  the  product  of  (4)  and 
(5): 


d(Sj  +  S*2)  ^  d(S*x  +  Si) 
S{  +  S2  Sx  +  S*z 

so  that  ln(5i  +  S2)  =  ln(S^  +  S*z),  or 

S[  +  S2  _  j  _  _  dS£- 

Sx  4*  d52  d5i 


(6) 


(7) 


Since  limt-o^SJ  (f)  -  S*(f))  =  0,  and  it  follows  that  Sf  =  Sx  and  similarly 
for  S’z.  □ 


From  Theorem  1,  any  continuous  subsurvival  pair  defines  a  unique  model 
of  class  I.  To  get  this  type  of  result  for  class  II  we  must  restrict  to  continuous 
strictly  monotonic  subsurvival  pairs. 
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Theorem  2  Let  {Si ,  S2}  be  a  pair  of  continuous  strictly  monotonic  subsur¬ 
vival  functions;  then  the  following  are  equivalent: 

1.  There  exist  random  variables  £  and  X  with  sgn(£)LX  such  that 


s;(t)  =  p{x  >m(<o} 
s;(t)  =  p{x  -  i  >  t  n  e  >  0} 


(8) 

(9) 


2.  For  allt  >  0 

S\{t)  Sj(t) 

Sf(0)  s;(0) 

Proof:  (1)  implies  (2).  Since  Xlsjn({): 


(10) 


s;(()  =  p{x  >  <|€  <  o}p{«  <  o}  =  P{x  >  t|f  <  o}s;<o)  =  p{x  >  t|f  >  o}s;(o) 

Si(t)  =  P{X  -  (  >  f|{  >  0}P{*  >  0)  =  P{X  -  (  >  t\(  >  0}S|(0). 


Hence  for  t  >  0; 


§M  =  p{A>*>0}>p{x-<>«>0  ).m 


(2)  implies  (1):  Let  X  and  Z  be  random  variables  with  survival  functions 
Sx(t)  =  Si{t)/Si( 0);  and  Sz(t)  =  S2*(t)/S2*(0).  Then  Sz{t)  <  Sx(t)  for 
all  t  >  0,  and  S*1  and  Sj1  exist.  Choose  a  random  variable  &±.X\  with 
P{6  =  1}  =  1  -  P{6  =  0}  =  S2(0);  and  put: 


f  =  «(X  -  Si'Sx(X))  -  (1  -  «)  (11) 
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We  have  {£  >  0}  =  {<5  =  1};  {£  =  —1}  =  {6  =  0}.  Hence  X±sgn(£)  and 


P{X-£>t  n£>0}  =  P{X-£  >  <|f  >  0}S2*(0)  = 
P{S?Sx(X)  >  t}S*2(0)  =  P{X>SxlSz(t)}SZ( 0)  = 

(sxs^sz)(t)s;(0)  =  sz(t)s'2(0)  =  s;(t), 

Also  P{X  >  t  n  i  <  0}  =  Sx{t)P{£  <  0}  =  5^(05! (0)  =  SJ(t).D 

Observations  of  a  right  censored  process  Y  yield  an  empirical  subsur¬ 
vival  pair  which  converges  to  S*x  and  S*z.  These  in  turn  express  all  we  can 
learn  from  the  empirical  process.  Theorem  (1)  says  that  a  unique  random 
censoring  model  will  always  describe  the  process.  Theorem  (2)  says  that 
an  age-dependent  censoring  model  will  describe  the  process  if  (10)  holds. 
Uniqueness  in  this  case  is  impossible  as  the  distribution  of  £  given  £  <  0  is 
undetermined. 

Of  course,  if  the  inequality  in  (10)  were  reversed,  then  a  model  could 
be  found  in  which  the  roles  of  X  and  Z  were  reversed.  In  other  words  the 
sign  of  £  would  be  independent  of  the  censoring  variable  Z.  Such  models 
would  be  difficult  to  interprete  in  a  reliability  context  as  it  is  difficult  to 
assign  meaning  to  Z  when  X  <  Z.  However,  in  medical  cohort  studies  this 
type  of  model  might  make  sense.  Typically  the  end  date  of  such  studies  is 
fixed  in  advance,  but  patients  are  admitted  into  the  study  at  random  times, 
after  undergoing  some  treatment.  The  variable  Z,  denotes  the  amount  of 
time  that  patient  i  is  potentially  under  observation,  that  is,  the  difference 
between  the  study  termination  date  and  V s  entrance  date.  Let  X,-  denote 
the  time  after  treatement  at  which  i  dies.  It  is  plausible  that  whether  i 
responds  to  treatment,  and  hence  does  not  die  within  the  time  of  the  study, 
is  independent  of  the  date  at  which  i  entered  the  study.  Hence  we  could 
model  Xi  =  Z,  —  £  with  Z,±sgn(^).  Observations  fitting  this  pattern  are 
reported  in  [9]. 

The  following  theorems  relate  class  I  to  the  assumptions  of  exponentiality 
for  X  and/or  Z. 

Theorem  3  Let  X1.Z  with  continuous  strictly  monotonic  distribution  func¬ 
tions.  Then  any  two  of  the  following  statements  imply  the  others 
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1.  Sx(t)  =  exp  A* 

2.  S'x(t)  =  jA-exp-t^)* 

5.  5M0  =  *exp-^ 

Sz(f)  =  exp_7< 

Proof:  For  {1,2}  =»  3  and  {2,3}  =>•  1,  substitute  the  information  given  into 
equations  (2)  and  (3).  {1,4}  =>•  2  is  a  direct  computation.  For  {1,3}  =>  4, 
put  fz(u)  =  —dSz(u)/du,  then 

S*z(t)  =  [  exp_Au  fz{u)du  =  — 1~—  exp~(A+7)t 
Jt  A  +  7 

Taking  derivatives  of  both  sides  yields  fz(t)  =  7exp“7',  which  yields  4.  □ 
Corollary  If  X  J_  Z  and  Sx(t)  =  exp~A<,  Sz(t)  =  exp-7*,  then 

^z(f)  _  $x(t) 

5J(0)  S-x(  0) 


The  above  property  does  not  uniquely  characterize  the  exponential  dis¬ 
tribution  [5], 

Theorem  4  Let  X±Z  and  suppose  that  there  exists  a  A  such  that  for  all 
T  >  0: 

P{X  <T  O  X  <  Z}  =  A  E(X  A  Z)1{XAZ<T};  (12) 

then  Sx(t)  =  exp~A*.  Conversely,  if  X±Z  and  Sx{t )  =  exp-At,  then  (12) 
holds. 
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Proof:  Using  Sxaz  =  SxSz  =  Sx  +  S*z,  and  E(X  A  Z)1{xaz<t}  =  Jo  ($x  + 
S*z ),  we  have: 


P{X  <TfU  <  Z) 
Sx( 0)  -  S*X(T) 

rT 

X  /  S*A2(u)dtt 


P{X  <  Z}  -  P{X  >rnx  <Z}  = 

-  [TdS*x(u)  = 

Jo 

X  fT(S*x  +  S*z){u)du. 

JO 


Since  this  holds  for  all  T  >  0, 


dS*x(u) 

Sx(u)  +  Sz(u) 


The  proof  is  concluded  by  substituting  this  into  (2).  The  converse  is  an 
easy  computation. □ 

It  is  interesting  to  remark  that  Theorem  4  is  a  derivation  of  TTT  which 
does  not  appeal  to  the  maximum  likelihood  estimate  of  X.  Indeed,  put  T  =  oo 
in  (12),  then  TTT  consistently  estimates  1/A. 

Let  Sx  and  Sz  be  strictly  monotone,  so  that  their  derivatives  exist  ev¬ 
erywhere.  Recall  that  the  failure  rate  of  X  =  —  (d/dt)(\ogSx)-  We  say  that 
Z  is  IFR  if  its  failure  rate  is  increasing,  and  DFR  if  it  is  decreasing  ([2]). 


Theorem  5  Let  XLZ  with  subsurvival  functions  Sx  and  Sz  respectively. 
Let  Sx(t)  =  exp-At,  and  let  Sz(t)  be  continuous  and  strictly  monotone.  If  Z 
is  DFR  then 


>  s*z(f) . 
S\i  0)  s-z(oy 


t  >  o. 


Similarly,  if  Z  is  IFR  then: 


.  sz{t)  _ 
Sx(  0)  Siioy 


t  >  0. 
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Proof:  We  prove  only  the  first  statement,  as  the  proof  of  the  second  is 
similar.  Let  Gz(t)  =  Sz(t)/Sz( 0),  and  Gx{t)  —  5^(t)/5^-(0),  and  put 
gz(t)  =  dGz(t)/dt,  gx(t)  =  dGx{t)/dt.  The  assumptions  on  X  and  Z 
insure  that  these  derivatives  exist  everywhere.  Since  the  failure  rate  of  X  is 
constant,  we  have  from  (2): 


dSx(t ) 

Sx(t)  +  S*z(t) 


From  (3)  we  may  write  the  failure  rate  of  Z  as 


dSz( 0  _  \  dSz{t) 

S'x(t)  +  S*z(t)  ~  dS-x(ty 


(13) 


The  failure  rate  of  Z  is  decreasing  if  and  only  if  gz(t)/gx(t)  is  decreasing,  or 


n  .  gz{t)gx{t)  -  9x{t)gz(t) 
gxW 


(14) 


where  gz(t)  =  dgz{t)/dt,  gx(t)  =  dgx(t)/dt.  Since  gz( 0)  =  gx( 0)  =  1,  we 
have  3z(0)  <  gx( 0)  and  for  t  sufficiently  close  to  0,  we  have  gz(t)  <  gx(t). 

The  set  A  —  {t  >  0|^v(f)  =9z{t)}  is  closed,  and  has  a  smallest  element  if  A 
is  non-empty.  Suppose  A  is  non-empty  and  that  f0  is  its  smallest  element. 
Since  gz(t)  <  gx(t)  for  t  <  t0,  we  must  have  gz(to)  >  gx(yo)-  However,  since 
gz{t o)  =  gx{to),  wo  must  have  from  (14)  gz(to)  <  gx{to)-  Hence  A  is  empty. 

□. 

4  Conclusions 

Supposed  has  an  exponential  distribution  and  that  Sz(f)/S2(0)  <  Sx(t)/Sx(  0). 
The  data  can  be  described  either  by  an  age-dependent  censoring  model,  or 
by  a  random  censoring  model  with  a  DFR  censoring  variable.  The  conse¬ 
quences  of  making  the  wrong  choice  in  estimating  the  expectation  of  A'  can 
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be  quite  severe.  Suppose  we  order  the  failure  and  censored  observations  as 
x\,  ■  ■  ■ ,  xm,  z\,  •  •  •  zn.  If  we  assume  X  J_  Z  and  Sx  =  exp_A<,  then  by  Theorem 
4  we  could  consistently  estimate  the  expectation  of  X  as 


S™  1  ",  +  g?  1 


m 


However,  if  we  assume  an  age-dependent  censoring  model  then  P{X  =  x\X  < 
Z}  =  P{Y  =  x},  hence  we  could  consistently  estaimate  the  expectation  of 


X  as 


ES=i  Xi 


m 


In  typical  reliability  applications  m  «  n. 
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